<i>K</i> ?fold cross?validation for complex sample surveys
نویسندگان
چکیده
Although K-fold cross-validation (CV) is widely used for model evaluation and selection, there has been limited understanding of how to perform CV non-iid data, including those from sampling designs with unequal selection probabilities. We introduce methodology that appropriate design-based inference complex survey designs. For such we claim will tend make better inferences when choose the folds compute test errors in ways account design features as stratification clustering. Our mathematical arguments are supported simulations, our methods illustrated on real data.
منابع مشابه
Sample Surveys
A census is a complete enumeration of the population: data are collected from every unit in the population. In a survey, a subset of the population, called a sample, is taken. Census are taken at regular but infrequent intervals, e.g. every 5 or 10 years. In between, surveys are used to update results. The selection and estimation procedures of official surveys are almost always based on previo...
متن کاملSample Surveys
1. What is a Survey? 2. Probability sampling 3. Common probability sampling designs 3.1. Simple Random Sampling 3.2. Stratified Sampling 3.3. Cluster Sampling 3.4. Unequal Probability Sampling 3.5. Systematic Sampling 3.6. Stratified Multistage Sampling 4. Survey estimates and standard errors 5. Nonsampling errors 6. Sampling rare populations 7. Issues in Survey Design Acknowledgments Glossary ...
متن کاملResampling Methods for Sample Surveys
Application of resampling methods in sample survey settings presents considerable practical and conceptual difficulties. Various potential solutions have recently been proffered in the statistical literature. This paper provides a brief critical review of these methods. Our main conclusion is that, while resampling methods may be useful in some problems, there is little evidence of their useful...
متن کاملMethods for Extreme Weights in Sample Surveys Methods for Extreme Weights in Sample Surveys
In survey sampling practice, planned and unplanned variation in the sampling weights can result in inflated sampling variances. As a result, extreme sampling weights are sometimes trimmed to reduce the sampling variance. However, when sampling weights are trimmed, a bias can be introduced into the survey estimates. The goal of sampling weight trimming is to reduce the sampling variance while av...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Stat
سال: 2022
ISSN: ['2049-1573']
DOI: https://doi.org/10.1002/sta4.454